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ABSTRACT 

We present models for the three-point correlation function (3PCF) of both dark matter 
and galaxies. We show that models based on the halo model can reasonably match the 
dark matter 3PCF obtained from high-resolution V-body simulations. On small scales 
(r < 0.5 ft- _1 Mpc) the 3PCF is sensitive to details regarding the density distributions 
of dark matter halos. On larger scales (r <; 2.0 ft. _1 Mpc) the results are very sensitive 
to the abundance of the few most prominent halos. Using the conditional luminosity 
function, we also construct models for the 3PCF of galaxies, which we test against 
large mock galaxy samples. The bias of the galaxy distribution with respect to the 
dark matter, and the finite number of galaxies that can be hosted by individual halos, 
significantly reduce the normalized three-point correlation function with respect to 
that of dark matter. Contrary to the 3PCF of the dark matter, the galaxy 3PCF 
is much less sensitive to details regarding the spatial number density distribution of 
galaxies in individual halos or to the abundance of the few most massive systems. 
Finally, we show that our model based on the conditional luminosity function is in 
good agreement with results obtained from the 2-degree Field Galaxy Redshift Survey. 
In particular, the model nicely reproduces the observational finding that the 3PCF for 
early-type galaxies is slightly higher than that of late-type galaxies, and that there is 
no significant dependence of the 3PCF on galaxy luminosity. 
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1 INTRODUCTION 

Understanding the formation and evolution of large-scale 
structure in the Universe is one of the most important goals 
in cosmology. Most of observationally accessible information 
comes to us in the form of (galaxy) light, and large-scale 
structure studies have therefore predominantly focused on 
analyzing the spatial distribution of galaxies. The statis- 
tical tool that is most commonly used to quantify galaxy 
clustering are the correlation functions (Peebles 1980). For 
a Gaussian density field, the statistical properties are fully 
described by the two-point correlation function (or, equiv- 
alently, by the power spectrum in Fourier space), and all 
reduced higher order correlations are zero. However, even 
though the initial perturbations in the density field out of 
which all structure formed are generally thought to be Gaus- 
sian, the present-day distribution of galaxies in the Uni- 
verse is expected to be non-Gaussian. There are two reasons 
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for this. First of all, typical density perturbations on scales 
^ 10/i _1 Mpc have already become non-linear at the present 
time. On these scales, the non-linear dynamics can create 
non-Gaussian fluctuations. Secondly, galaxies may be biased 
with respect to the underlying mass distribution, which may 
produce additional non-Gaussian features. Therefore, there 
has been much interest in studying the higher-order corre- 
lation functions of galaxies, especially the three-point corre- 
lation function (3PCF) or equivalently, its Fourier counter- 
part, the bi-spectrum (Peebles & Groth 1975; Peebles 1980; 
Jing, Mo & Borner 1991; Jing & Borner 1997; Scoccimarro 
et al. 1998; Buchalter & Kamionkowski 1999; Bernardeau et 
al. 2002; Verde et al. 2002). 

Theoretically, the 3PCF of the mass distribution in the 
Universe has been considered by various authors using ei- 
ther analytical models or JV-body simulations (e.g. Fry 1984; 
Matsubara & Suto 1994; Suto & Matsubara 1994; Jing & 
Borner 1997, 1998; Gaztanaga & Bernardeau 1998; Frieman 
& Gaztanaga 1999; Barriga & Gaztanaga 2002; Takada & 
Jain 2003). Theoretical models for the 3PCF of galaxies, 



© 0000 RAS 



2 Wang et al. 



however, are much more difficult to obtain, as they depend 
on the details about how galaxies form in the cosmic density 
field. The simplest model, in which galaxies are assumed to 
be linearly biased with respect to the mass, is obviously an 
over-simplification. 

According to current the cold dark matter (CDM) sce- 
nario of structure formation, most of the mass in the Uni- 
verse is expected to be in dark matter halos. These are 
quasi-equilibrium systems of dark matter particles, formed 
through non-linear gravitational collapse. Since accurate an- 
alytical models are now available for the mass function, spa- 
tial clustering, and density profile of the halo population 
(e.g. Mo & White 2002 and references therein), there has 
been a lot of effort in recent years in constructing the so- 
called halo model, which describes the dark matter mass dis- 
tribution solely in terms of its dark matter building blocks 
(Ma & Fry 2000; White 2001; Kang et al. 2002; Mo, Jing & 
Borner 1997; Cooray & Sheth 2002 and references therein). 
In the CDM cosmogony, galaxies and other luminous ob- 
jects are assumed to form by cooling and condensation of 
the baryons within halos (White & Rees 1978). Hence, the 
distribution of galaxies can be linked to the halo model if 
it is combined with a model for the formation of galaxies in 
individual halos. Unfortunately, the physics of galaxy forma- 
tion are still poorly understood. One way to make progress, 
without a detailed theory for how galaxies form, is to model 
the statistics of galaxy occupation numbers in dark matter 
halos. Many recent investigations have used such halo occu- 
pation models to study various aspects of galaxy clustering 
(Jing, Mo & Borner 1998; Peacock & Smith 2000; Seljak 
2000; Scoccimarro et al. 2001; White 2001; Jing, Borner & 
Suto 2002; Bullock, Wechsler & Somerville 2002; Berlind & 
Weinberg 2002; Scranton 2002; Kang et al. 2002; Marinoni 
& Hudson 2002; Kochanek et al. 2003; Zheng et al. 2002; 
Berlind et al. 2003; Magliocchetti & Porciani 2003; Yan et 
al. 2003). 

In this paper, we investigate the 3PCF of both dark 
matter and galaxies using the halo model combined with the 
so-called conditional luminosity function (hereafter CLF). 
The CLF formalism was introduced by Yang et al. (2003) 
and van den Bosch et al. (2003) as an extension of the typi- 
cal halo occupation models. It not only contains informa- 
tion about the number of galaxies per halo, but also on 
their luminosities and morphological types. Using the CLF, 
therefore, allows us to investigate how the 3PCF of galax- 
ies depends on galaxy type and luminosity. Our purpose 
is twofold. First, we use high-resolution JV-body simulations 
and realistic mock galaxy samples constructed from them to 
check the accuracy of the model predictions for the 3PCFs. 
Secondly, we construct detailed mock galaxy redshift sur- 
veys, and compare the resulting 3PCFs with those obtained 
from the 2 degree Field Galaxy Redshift Survey (2dFGRS, 
Colless et al. 2003). Earlier investigations, based on simple 
assumptions about the relation between galaxies and dark 
matter, showed that the 3PCF predicted by the current 'con- 
cordance' cosmology is significantly higher than the obser- 
vations indicate (Jing & Borner 1998, 2004). We show that, 
using our more realistic mock samples based on the CLF, 
the discrepancy between model and observation can be al- 
leviated. 

This paper is organized as follows. In Section 2, we out- 
line the halo occupation distribution (HOD) models. We 



present halo-based models for the two- and three-point cor- 
relation functions in Sections 3 and 4, respectively. Com- 
parisons between model predictions and simulation results 
are made in Section 5. In Section 6, we compare model pre- 
dictions of the redshift-space 3PCFs obtained from mock 
samples with the observational results obtained from the 
2dFGRS. Finally, we draw conclusions in Section 7. 



2 OVERVIEW OF THE HALO MODEL 

The basic idea of the halo model is to describe the evolved, 
non-linear dark matter distribution in terms of halos with 
different masses. On strongly non-linear scales, the cluster- 
ing of dark matter can be understood in terms of the actual 
density profiles of halos, while on larger, linear scales, it 
can be understood in terms of the spatial distribution of 
dark matter halos (see Cooray & Sheth 2002 and references 
therein). The halo model contains three essential ingredi- 
ents, which we review below. 



2.1 Halo mass function 

The mass function of dark matter halos, n(M)dM, describes 
the number density of dark matter halos as a function of halo 
mass. The Press-Schechter formalism (Press & Schechter 
1974) yields an analytical estimate for n(M), and we use 
the form given in Sheth, Mo & Tormen (2001): 

diner 



n{M)AM=^vf{v) 



AM, 



(1) 



dlnM 

where p is the mean matter density of the Universe, v = 
8 c /a(M), and 8 C is the critical over-density required for col- 
lapse. The quantity a(M) in the above equation is the linear 
rms mass fluctuation on mass scale M and f(v) is a function 
of v: 

1/2 



(2) 



with v' = y/av, a = 0.707, q = 0.3 and A w 0.322. 
The resulting mass function has been shown to be in excel- 
lent agreement with numerical simulations, as long as halo 
masses are defined as the masses inside a sphere with an 
average over-density of about 180 (Sheth & Tormen 1999; 
Jenkins et al. 2001). In what follows, we define the radius of 
this sphere as riso and the corresponding volume as Viso- 



2.2 Halo density profile 

The dark matter density profile, p(r) , describes the mass dis- 
tribution within individual dark matter halos (e.g., Navarro, 
Frenk & White 1997, hereafter NFW; Moore et al. 1998; 
Jing 2002; Bullock et al. 2001). We assume that p(r) has 
the NFW form 



p(r) 



Sp 



(r/r s )(l + r/r s ) 2 



(3) 



where r s is a characteristic radius, and 5 is a dimension- 
less amplitude which can be expressed in terms of the halo 
concentration parameter c = riso/r s as 



5 = 



180 



3 ln(l + c) - c/(l + c) ' 



(4) 
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Numerical simulations have shown that c is correlated with 
halo mass (NFW; Eke et al. 2001; Jing 2002; Bullock et 
al. 2001; Zhao et al. 2003a,b). We use the following c-M 
relation: 

-0.13 



c(M) = A 



M 
~M* 



(5) 



where M* is the nonlinear mass scale defined as a(M*) = S c . 
In most of our analyses we assume A — 14, but we also test 
the sensitivity of our results to changes in A. 



2.3 Halo clustering 

The number density and density profiles of dark matter ha- 
los allow one to compute the clustering properties of dark 
matter on small scales. On large scales, however, one needs 
information regarding the spatial distribution of dark mat- 
ter halos. The halo-halo two-point correlation function is re- 
lated to the mass correlation function through the so-called 
halo bias factor b (e.g., Mo & White 1996, 2002; Sheth, Mo 
& Tormen 2001). On large scale, we can write 



£hh(r;Mi,M 2 ) = 6(Mi 



)6(M 2 )C 2 d h m M ; 



(6) 



where £ 2 ™(r) is the 2-halo term of the dark matter correla- 
tion, to be specified later, and 



b{M) 



1 + 



1 



\fa8c 



y/a (av 2 ) + y/ab (av 2 ) 1 c — 
{av 2 Y 



{av 2 Y + 6(1- c)(l- c/2) 



(7) 



with a = 0.707, b = 0.5, c = 0.6 and v = S c /a(M) (Sheth, 
Mo & Tormen 2001). 

We follow a similar approach for the three-point corre- 
lation function of dark matter halos and assume that it has 
the form 

Chhh(ri2,r-23,r3i,Mi,M 2 ,M 3 ) = b(M 1 )b(M 2 )b(M 3 ) 

x <3hVi2,r 23 ,r 3 i), (8) 

where C3h n ( r i2, ?*23, r 3 i) is the 3-halo term of the 3PCF of 
dark matter (to be specified below). Note that we neglected 
the quadratic term in the relation between halo number den- 
sity and mass density, i.e. we assumed linear bias between 
halo distribution and mass distribution, which is expected 
to be valid in quasi-linear regime (e.g. Mo, Jing & White 
1997; Ma & Fry 2000; Takada & Jain 2003). We will discuss 
the impact of including this quadratic term on the 3PCF 
later in Section 5.2.4. 



2.4 Halo occupation numbers 

In order to construct a model for the three-point correlation 
function of galaxies, we need to know how galaxies populate 
dark matter halos. Here the key quantity is the halo occupa- 
tion number, (N(M)), which describes the average number 
of galaxies (with luminosities greater than some limiting lu- 
minosity) that occupy a halo of mass M. As discussed in 
Section 1, numerous studies have used such halo occupa- 
tion number models to investigate how changes in (N(M)) 
impact on the statistical properties of the galaxy distribu- 
tion. In a series of recent papers, Yang, Mo & van den Bosch 
(2003) and van den Bosch, Yang & Mo (2003) have taken this 



halo occupation approach one step further by considering 
the occupation as a function of galaxy luminosity and type. 
They introduced the conditional luminosity function (here- 
after CLF) <E>(L|M)dL, which gives the number of galaxies 
with luminosities in the range L ± dL/2 that reside in ha- 
los of mass M. Yang et al. (2004) constructed mock galaxy 
redshift surveys based on this CLF, and showed that many 
of the corresponding low-order clustering properties are in 
good agreement with the 2dFGRS observations, both in real 
and redshift space. 

With the CLF the occupation numbers can be com- 
puted as function of both luminosity and type. For exam- 
ple, the average halo occupation number for galaxies within 
a given luminosity range, L\ < L < L 2 , is, 



M{M) = / $(L|M)dL. 



(9) 



ii 



This halo occupation number can be further divided into 
early- and late- type components (e.g., van den Bosch et 
al. 2003), 



A/(M) = A/early (M) + Mate(M) . 



(10) 



or into central and satellite galaxy components (Yang et 
al. 2004; van den Bosch et al. 2004), 



M(M) = M c (M) + A4 (M) , 



(11) 



Here N c is either zero or unity, and the central galaxy is 
always located at the center of the halo. Satellite galaxies, 
on the other hand, are assumed to follow a number density 
distribution given by n s (r). In this paper, we adopt the same 
CLF as the fiducial model in Yang et al. (2004), i.e. model 
D in van den Bosch et al. (2003). 



3 HALO-BASED MODELS OF TWO-POINT 
CORRELATION FUNCTIONS 

In the halo model, the two-point correlation function for 
dark matter (and galaxies) can be decomposed into two 
parts, 



£(r)=£ lh (r)+£2h(r), 



(12) 



where £ih represents the correlation due to pairs of dark 
matter particles (or pairs of galaxies) within the same halo 
(the "1-halo" term), and £ 2 h describes the correlation due to 
dark matter particles (galaxies) that occupy different halos 
(the "2-halo" term). 

For convenience, we introduce the normalized halo pro- 
file um(c) = p(r)/M, and the normalized number density 
distribution of satellite galaxies u B (r) — n s (r)/J\f B (M), so 
that 



/ ■ 

J Vigo 



d x u M ,s(r) = 1 , 



(13) 



where Viso is the volume of the sphere defined by the virial 
radius riso- 
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3.1 Two-point correlation function for dark 
matter 

The 1-halo term mass correlation function can be calculated 
from the dark matter density distribution (Ma & Fry 2000) : 



6h (r) 



poo p 

_ 2 / dMn(M) / d 3 x p(x)p{\x + r\) 
P Jo J V 1S0 

= — / AMn(M)M 2 f M (r) (14) 
P Jo 

where /m (r) is the particle pair distribution function within 
a dark matter halo of mass M: 



fM(r) — 2n / hm(s)s ds / wm(|s + r|) sin# dO , (15) 
Jo Jo 

with |s + r| = (s 2 + r 2 + 2sr cos 6>) 1/2 . 

Formally, one can write the 2-halo term of the dark 
matter 2PCF as 

1 poo poo 

&h{r) = — / dM 1 n(M 1 )M 1 / dM 2 n(M 2 )M 2 
P Jo Jo 
£hh(r;Mi,M a ). (16) 

The halo-halo correlation function can be directly related 
to the non-linear 2PCF ^N™( r ) of the dark matter by tak- 
ing account of halo-halo exclusion and of the fact that dark 
matter halos are biased tracers of the mass: 

£hh(r;Mi,M 2 ) = b(M 1 )b(M 2 )U(r,M 1 )U(r,M 2 )^(r) , (17) 

where 



«**>-{? 'L r<rm(M) 

Thus, we can write 

/■dm/ \ T fdm/ \1 ^ /-dm/ \ 

&2h (r) = [/excMJ CnlM , 
where 

1 



/dm / \ 
BXc(r) = 



n(M) M b(M) U{r, M) AM . 



(18) 



(19) 



(20) 



We compute £nl(t) from the Fourier transform of the non- 
linear power spectrum Pnl(A;) given by Smith et al. (2003). 
As we show below, the choice of r oxc (M) can significantly 
affect the amplitude of the 2PCF on intermediate scales (r ~ 
2 7i _1 Mpc). Unfortunately, this effect is difficult to model 
from first principle. What we will do is to tune its value 
so that the model prediction for the 2PCF best matches 
simulation results. Note that when we consider the 2-halo 
term of the 2PCF, we do not take into account the impact 
of the halo profile. This effect is negligible compared to the 
halo-halo exclusion effect. 



3.2 Two-point correlation function for galaxies 

As for the dark matter, we split the 2PCF for galaxies in a 
1-halo and a 2-halo term. In order to model the 1-halo term, 
we need to specify the distribution of galaxies in individual 
halos. As stated above, we assume that the central galaxy 
is located at the halo center, and that the satellite galaxies 
follow a normalized number density distribution given by 
u s (r). For simplicity, unless specifically stated otherwise, we 
assume that u s (r) = um{t), i.e., that the number density 



of satellite galaxies is the same as that of the dark matter 
particles within the halos. 

We can write the 1-halo term of the galaxy correlation 
function as 



e ih (r) = / n{M) (ATp air (M)) f(r) dM , 



(21) 



where (iV pa i r (M)} is the mean number of pairs in halos of 
mass M, f(r)4irr 2 Ar is the fraction of pairs with separation 
in the range r ± Ar/2, and n g is the mean number density 
of galaxies given by 



poo 

I n(M)M(M) dM . 
Jo 



(22) 



The mean number of pairs as function of separation, 
(-/Vp a i r }/(r), can be divided into contributions from central- 
satellite pairs and satellite-satellite pairs: 



(A r pair)/(r) = {N ca )u s (r) + (N ss )f s (r) , 



(23) 



where f s (r) follows from eq. (15) upon substituting u s for 
mm- The number of central-satellite pairs is 



(ATcs) =Mc{M)M s {M). 



(24) 



Since (N ss ) depends not only on the mean occupation 
J\f B (M), but also on the second moment, we adopt the near- 
est integer model in which N S (M) has the probability of 
N + 1 - N S (M) to take the value N and the probability of 
K{M)-N to take the value N + l, if N < M B {M) < N + l. 
In this case, the mean number of satellite-satellite pairs is 



(N SS )=NK(M)--N(N + 1). 



(25) 



The 2-halo term of the 2PCF for galaxies follows from 
cq. (16) upon substituting n g for p and Af(Mi) and M(M 2 ) 
for Mi and M 2 , respectively. This yields 



e 2h (r) = [/ c g xc (r)] 2 CnlM , 
whore 



(26) 



/ c s xc(r) = ^/ n(M)N{M)b{M)U{r,M)dM. (27) 
n s Jo 

As for the dark matter, we consider r cxc a free parameter 
which we tune to best match the 2PCF of our mock galaxies. 



4 HALO-BASED MODELS OF THREE-POINT 
CORRELATION FUNCTIONS 

The three-point correlation function C( r i2, r 2 g, r^i) in real 
space is defined through the probability dPi23 of finding 
one particle simultaneously in each of the three volume el- 
ements dVi, dV 2 and dV-j that are located at n, r 2 and r3, 
respectively. By definition, this probability is related to the 
3PCF as 

dPm = [l+£(ri2)+£(r23)+£(r 3 i) + C(n2,r23,r-3i)] 
xn s dV 1 dV 2 dV 3 , (28) 

where nj = |r» — Tj\ and n is the mean number density of 
particles (Peebles 1980). It is common practice to express 
the 3PCF in the so-called normalized form, 



Q(r,u,v) = 



C(ri 2 ,r 23 ,r3i) 



£(ri 2 )£(r 2 3) + £(r23)£0"3i) + £(r 3 i)£(ri 2 ) 



(29) 
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Figure 1. The 2PCFs for dark matter (left panel) and mock galaxies with luminosity — 5 log h < —18.5 (right panel). The circles 
in the left panel are the results obtained from one 300 h~ 1 Mpc box simulation; the circles with error-bars in the right panel are the 
mean and 1-cr error obtained from four simulations. The thin dot-dashed, thin doted and thick dot-dashed curves in the left panel are 
the 1-halo term, 2-halo term and total halo model predictions for the fiducial dark matter 2PCFs. In this fiducial model, we adopted a 
concentration normalization A = 14 and a mean halo-halo exclusion radius r cxc = 1.5 fiso- The short and long dashed lines in the left 
panel correspond to the 2PCFs for A = 9 and r oxc = 2.0 riso models, respectively. For comparison, the fitting result of Smith et al. (2003) 
is shown as the solid line. The fiducial model predicts a 2PCF that is consistent with both the simulation and Smith et al. results. In the 
right panel, we compare the model prediction and simulation results for galaxies. While for the galaxy 2PCFs, the model with a mean 
halo-halo exclusion radius r oxc = 2.0 rigo consists with the simulation results extremely well (see the text for details). 



where, following Peebles (1980), the new variables, 

f31 — 7*23 



ri2, 



7~23 
T12 ' 



ri2 



(30) 



describe the shape (u and v) and size (r) of the triplet with 
sides ri2 < »"23 < r 3 i. If Q(r,u,v) is constant, the 3PCF is 
said to have the 'hierarchical form', i.e. 

C(n, r 2 , r 3 ) oc [£(n)£(r 2 ) + £(r 2 )£(r 3 ) + £(r 3 )£(n)] . (31) 

Following the approach for the 2PCF, we write the 
3PCF as the sum of the '1-halo', '2-halo' and '3-halo' terms: 

C(ri2, J"23,r 3 i) = Cih(ri2,r 2 3, r 3 i) + C2h(ri2,r23,r 3 i) 

+ C3h(ri2,r23,r 3 i) , (32) 

where ri 2 , r 2 3, and r 3 i are the lengths for the three edges 
of the triplet. Without loosing generality, we assume ri2 < 
7*23 < ?"3i • In the remainder of this section, we present models 
for the various terms for both dark matter particles and 
galaxies. 



4.1 The three-point correlation function for dark 
matter particles 

Following the method described in Takada & Jain (2003), 
we write the 1-halo term of the 3PCF for dark matter as 

Ci d h m (ri2,r 2 3,r 31 ) = ^ J AM n{M) M 3 ffM (l,2,3), 



(33) 



where 
<?m(1,2,3) 



p r 

Jo 

" f 

Jo 



um{s) s ds / um(|s + ri2 1) sin0< 
Jo 



u M (\s + r 13 \) dip . 



(34) 



Here |s + ri 2 | = (s 2 + r\ 2 + 2sri2 cos O) 1 ^ 2 and |s + ri 3 | = 
(s 2 + rf 3 + 2sri3 cos^i) 1 / 2 . The angle 6\ is given by 



cos 9i = sin a sin 9 cos ip + cos a cos 6 , 
with 



a = cos 



rf 2 + rf n - ri 3 



2r 12 r 31 



(35) 



(36) 



The 2-halo term in the 3PCF of dark matter can be 
written as 

^2 d h m (r 2 3)+?2 d h m (r-3l) 



Cam / \ 
2h (ri2,r 2 3,r 3 i) 



dm / \ k 

= £rh (ri2)- 
+ perm(l, 2, 3) , 



(37) 



where perm(l,2,3) is the permutation of the three points, 
and 



dm / \ 

£ih (r) 



1 



dMn(M) M 6(M)/ M (r). 



(38) 



with fyi(r) defined in eq. (15). 
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Figure 2. The 2PCFs for mock galaxies with different models 
for the satellite distribution in halos. The solid line is the fidu- 
cial model, where satellite distribution in a halo follows the NFW 
with the concentration normalization A = 14. The dotted line 
corresponds to result where the number of galaxies in a halo has 
a Poisson distribution with the mean given by the mean halo oc- 
cupation number. The short dashed line (FOF) is the result in 
which satellite galaxies are traced by dark matter particles in the 
FOF halos that host them. The long dashed line shows the re- 
sult with the concentration normalization A = 9. The dot-dashed 
line is the result where the number of satellite galaxies in a halo 
follows a Poisson distribution with the mean given by the mean 
occupation number of satellite galaxies. Due to the simulation 
resolution, only galaxies with Mj, 7 — 51og/i < —18.5 are used. 

Similar as with the 2-halo term in the 2PCF, we can 
use the halo exclusion principle to write the 3-halo term of 
the 3PCF in terms of the non- linear, dark matter 3PCF, Cnl 
(cf. eq. [19]). However, no accurate theoretical model for 
that can cover both small and large scales is currently avail- 
able. We therefore use the quasi-linear 3PCF, Cnl, instead, 



and write 

C 3 h (7*12, 7*23, 7*31 ) 



T /"dm/ \ rdm / \ rdm 
|/exc (ri2) /exc (f 23) /exc 



(t* 3 i)] 



CQL(fi2,r 23 ,r 3 i) . 



(39) 



Note that the quasi-linear 3PCF is only applicable at large 
scales. However, since the number of triplets on small, highly 
non-linear scales is expected to be dominated by the 1-halo 
and 2-halo terms, the use of for to compute the 3-halo 
term should be sufficiently accurate. The quasi-linear 3PCF 
has been obtained from perturbation theory (e.g. Fry 1984; 
Matsubara & Suto 1994; Jing & Borner 1997; Barriga & 
Gaztafizga 2002), and can be written as 

Cql(7*12,7*23,7* 3 i) = y £(7*12) £(t* 23 ) + 



+ 



<j>'(r 12 )<f>'(r 2 3) C(7-i2)0 / (t-23) £(r 23 )(ft'(ri2) 



£(ri 2 ) + 3 



7*127*23 

0'(7*12) 



7*23 



7*12 



7*12 



£(»*23) + 3 



4>'(r 23 ) 



7*23 



-li [£' (7*12)0' (7*23) + £' (7*23)0' (7*12)] + perm (1, 2, 3) , (40) 



where fj, = (n 2 • r 32 )/(ri 2 r 2 3), 
Ph(k) sin(fcr) 

4>(r) - 



— r 

Wo 



k 2 



kr 



k 2 dk , 



(41) 



with Ph{k) the linear power spectrum, <j>'(r) = A(f>/Ar, and 
perm (1,2, 3) the permutation of the three points of the 
triplets. iV-body simulations show that this formula is a 
good approximation on scales 7*12 ii 6 /i _1 Mpc (Jing & 
Borner 1997; Barriga & Gaztanaga 2002). Since we are inter- 
ested in the reduced 3PCF Q(ri2, 7*23, 7*31), which we define 
to be the 3PCF normalized by the square of the nonlinear 
2PCF £nl(t*), we made a modification in equation (40) by 
replacing £(r) with Cnl(t')- 

4.2 The three-point correlation function for 
galaxies 

As for the dark matter, we write the 1-halo term in the 
3PCF of galaxies as 



Cfh (7*12, 7- 23 , r 3 i) 



_6_ r 

"2 Jo 



AM n(M) (JVtripiet) 5(1,2, 3) ,(42) 



where (iVt r ipiet) is the average number of triplets per halo of 
mass M and g(l, 2, 3) is the fraction of triplets in a particular 
configuration. Note that both (A*t r i P iot) and p(l,2,3) may 
depend on halo mass M. The triplets can be divided into 
central-satellite-satellite and 3-satellite triplets: 

(iV trip iet)s(l, 2, 3) = (iV css ) Sc (l, 2, 3) + (iV sss ) ffs (l, 2, 3) , (43) 

where g s (l, 2, 3) follows from eq. (34) upon substituting mm 
with m s , and <? c (l, 2, 3) is given by 



flc (l, 2, 3) = - [Ws(7*i2) M s (r 23 ) + perm(l, 2, 3)] . 



(44) 



Using the same sampling algorithm as described in Section 
3.2 ( N S (M) has the probability of N + 1 - Af a (M) to take 
the value N and the probability of Af B (M) — N to take the 
value N + 1, if TV < M B {M) < N + 1), wc can write the 
number of central-satellite-satellite triplets, (N caa ), as 

<iV css ) =A/" c (M)<iV ss >, (45) 

and the number of 3-satellite triplets, (N sss ), as 

(46) 

The 2-halo term in the 3PCF for galaxies is similar to 
that of dark matter particles, and can be expressed as 

^! h (r 2 3)+£f h (7*3l) 



Cfh ( r 12, r 2 3 , 7*31 ) 



where 



= 4>12)" 

+ perm(l,2,3), 



ih M = zj? / dM n ( M ) KM) <iV pair ) f(r) , 



with f(r) given by eq. (23) and 
1 f°° 

b=— n(M) N{M) b(M) AM . 
n s Jo 



(47) 



(48) 



(49) 



Finally, for the 3-halo term of the galaxy 3PCF we write 



Cfh (7*12, 7*23, 7*3l) 



[/cxc(7*l 2 ) /c S xc(7* 2 3) /lxc(7*3l)] 
Cql (7*12, 7" 2 3, r 3 l) . 



(50) 
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Figure 3. The normalized 3PCFs of equilateral triangles for dark matter as a function of the size of the triangle. The thick and thin 
curves show the total and 1-, 2- and 3-halo contributions to the 3PCF, respectively. The solid and dashed lines in the left panel are 
results for dark matter with different concentration normalization A. We compare model predictions with simulation results in the right 
panel. Squares with error bars are simulation results. The thick solid curve is the same as that in the left panel, while the dot-dashed 
lines are the results that assume the mass function in the model is truncated at M = 2 X 10 15 Ii~ 1 'Mq to mimic the incompleteness in 
the simulations. 



5 COMPARISON WITH SIMULATIONS 

The analytical halo models for the two- and three-point 
correlation functions of dark matter and galaxies are de- 
rived based on a number of simple assumptions, and so 
their validity needs to be checked. In this section, we use 
high-resolution numerical simulations and mock galaxy dis- 
tributions (hereafter MGDs), to test the accuracy of these 
models. 

The set of iV-body simulations used here were car- 
ried out by Y.P. Jing and Y. Suto (see Jing 2002; Jing 

6 Suto 2002) on the VPP5000 Fujitsu supercomputer of 
the National Astronomical Observatory of Japan using a 
vectorized-parallel P 3 M code. The set consists of a total of 
six simulations, each of which used N = 512 3 particles to 
evolve the distribution of dark matter from an initial red- 
shift of z — 72 down to z — in a ACDM 'concordance' 
cosmology (fi m = 0.3, D.a = 0.7, H = 70 kms -1 Mpc -1 , 
and as = 0.9). All simulations consider boxes with peri- 
odic boundary conditions. Two of them use boxsize Lbox = 
lOOh -1 Mpc, with a force softening length of ~ 10/i _1 kpc, 
while the other four simulations have Lbox = 300/i _1 Mpc, 
with force softening length ~ 30/i _1 kpc. Different simula- 
tions with the same box size are completely independent 
realizations and are used to estimate uncertainties due to 
cosmic variance. The particle masses are 6.2 x 10 8 /i _1 M 
and 1.7 x 10 10 /i _1 Mq, for the small and large box simula- 
tions, respectively. 

The mock galaxy distributions (MGDs) are obtained by 
populating dark matter halos in these iV-body simulations 



with galaxies according to the conditional luminosity func- 
tion (CLF) model. The construction of such MGDs is de- 
scribed in Yang et al. (2004) and van den Bosch et al. (2004) , 
and we refer the reader to these papers for details.. We 
use the same CLF model as in Yang et al. (2004), which 
yields excellent fits to the observed LFs and the observed 
correlation lengths as function of both luminosity and type. 
The satellite galaxies in the MGDs are assumed to be dis- 
tributed within ri8o and with a number density distribution 
that follows the density distribution of the dark matter (i.e. 
« s (r) = tiiu(r)). In this section we only use the 300/i _1 Mpc 
simulations. As discussed in Yang et al. (2004), because of 
the finite resolution of these simulations, the galaxy popula- 
tion in these MGDs are only complete down to an absolute 
magnitude limit of Mb , — 5 log ft w —18.5. 



5.1 Two-point correlation functions 

The left-hand panel of Figure 1 plots the dark matter 2PCFs 
obtained from the analytical halo model described above 
(various lines) and from the numerical simulations (open 
circles). The solid line corresponds to the nonlinear 2PCF 
calculated using the Smith et al. (2003) fitting formula, and 
is in good agreement with the simulations. The dot-dashed 
line indicates the predictions for our fiducial model, where 
we have assumed A = 14 and r cxc = 1.5 riso- With these 
assumptions, the predicted 2PCF matches both the Smith 
et al. model and the simulation results extremely well. The 
dashed and dotted curves show the corresponding contribu- 
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tions of 1-halo and 2-halo terms, respectively, and are shown 
for completeness. We also plot the results for different halo 
concentrations (A — 9) and for a different exclusion radius 
(r cxc = 2.0 riso). The former predicts a significantly lower 
1-halo term, while the latter predicts a lower 2-halo term on 
intermediate scales (r ~ 2 h~ Mpc). 



The right panel of Figure 1 shows the 2PCFs for galax- 
ies with luminosity Mbj — 51og/i < —18.5 where we set 
u s (r) = Mivi(r), i.e., the number density of galaxies follows 
the density distribution of the dark matter. The lines are 
model predictions, while the open circles with error-bars 
indicate the mean and variance obtained from four inde- 
pendent MGDs. This time the mean halo exclusion radius 
that best matches the results obtained from the MGDs is 
r C sc = 2.0 riso- Although in good agreement with Maglioc- 
chetti & Porciani (2003), this radius is slightly larger than 
that used for the dark matter. If we use the same exclusion 
radius as for the dark matter, r oxc = 1.5 riso, the 2PCF of 
galaxies is slightly over-estimated on scales r ~ 2 h~ 1 Mpc. 
The fact that the model predictions are sensitive to the 
choice of the value of r cxc suggests that the predicting power 
of current halo-based models are somewhat limited around 
the transitional scale between the 1-halo and 2-halo compo- 
nents. 



Using very similar MGDs, Yang et al. (2004) examined 
how the 2PCF of galaxies depends on the spatial distribu- 
tion of satellite galaxies within individual dark matter ha- 
los. Here for completeness and to compare with such depen- 
dence in the 3PCF to be discussed later, we calculate the 
2PCFs for different u B (r) and different sampling of galaxies 
in the dark matter halos. The results are shown in Fig 2. 
The solid line shows the 2PCF for the fiducial model where 
u s (r) = um (r) with A — 14. For comparison, the long dashed 
line shows the results with A — 9. As expected, less concen- 
trated distributions of satellite galaxies result in reduced 
correlation functions on small scales. The dotted line corre- 
sponds to a model in which the number of galaxies in a halo 
is drawn from a Poisson distribution with the mean given 
by the mean occupation number. This results in a signifi- 
cant increase of the correlation power on small scales, and 
is a consequence of the fact that the second moment of the 
Poisson distribution is larger than that of the nearest integer 
distribution adopted in our fiducial model (see also Benson 
et al. 2000; Berlind & Weinberg 2002; Yang et al. 2003). 
In a recent study, Kravtsov et al. (2004) found that, while 
each halo may contain a central galaxy, the number of satel- 
lite galaxies follows a Poisson distribution. For comparison, 
we show the result of the MGD thus generated in Fig 2 
as the dot-dashed line. This result of this model agrees ex- 
tremely well with that of the the fiducial model. Finally, the 
short-dashed line indicates the 2PCF for a model in which 
each satellite galaxy is assigned the position of a randomly 
selected particle from the friends-of-friends (FOF) group as- 
sociated with the halo under consideration. Note that this 
yields a £ (r) that is in excellent agreement with our fiducial 
model. Overall, the spatial distribution of satellite galaxies 
only has a small effect on the 2PCF, and only on small scales 
(r < 0.5 /i _1 Mpc). 
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Figure 4. The normalized 3PCFs for dark matter. Symbols with 
error-bars are simulation results; lines are the halo model predic- 
tions. 



5.2 The three-point correlation functions 

To compute the 3PCFs of dark matter particles and mock 
galaxy distribution we compare the counts of triplets with 
those of randomly distributed points: 



C(ri2,r-23,f3i) = 



NlDDD(r 12 ,r 2 3,r3i) 
iVg RRR(r 12, r 2 3,r 3 i) 

-£(ri 2 ) - £(7-23) - f(r 3 i) 



(51) 



Here DDD and RRR are the triplet counts with separations 
in the ranges ri2 ± Ari2 /2, r 2 z ± Ar 2 3 /2, and r 3 i ± Ar 3 i /2, 
in the data (D) and random (R) samples, respectively, and 
Ad and JVr correspond to the total number of objects in 
each sample. 

We compute the normalized 3PCFs, Q(r,u,v), for dark 
matter particles and mock galaxies, using equal logarith- 
mic bins for r, with Alogr ~ 0.05, and equal linear bins 
for v and u, with Av = An = 0.1. For the dark matter, 
we use four subsamples of about 500,000 particles, each se- 
lected from one of the four realizations of the 300ft _1 Mpc 
box simulations to estimate DDD. For the MGDs, there 
are about 470,000 galaxies with absolute magnitudes — 
5 log h < —18.5 in each of the four mock samples. We use 
two sets of random samples to estimate RRR. The first con- 
tains 800,000 random points in the simulation box and is 
used to estimate the number of triplets with 5 h~ Mpc < 
r 3 i < 20 h~ Mpc. The second contains 10 times as many 
points and is used to estimate the counts of triplets with 
r 3 i < 5 /i _1 Mpc. This ensures that RRR > 200 for all 
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Figure 5. The same as Figure 3, but here results are shown 
for mock galaxies. Note that the Q £q (r) of galaxies is extremely 
different from that of dark matter particles on small scales (r < 
O.S/i^Mpc). 



triangle configurations of interest. In what follows, unless 
specifically stated otherwise, we adopt r oxc = 1.5riso for 
dark matter particles and r cxc = 2.0ri8o for galaxies, and 
set A — 14 and u B (r) — umW- 
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Figure 6. The normalized 3PCFs for mock galaxies with M^. — 
51og/i < —18.5. Symbols with errorbars are simulation results; 
lines are the halo model predictions. 



5.2.1 The 3PCF for dark matter 

We first consider the normalized, dark matter 3PCF of equi- 
lateral (u = 1, v = 0) triangles, Q cq (r). The left-hand panel 
of Figure 3 plots Q aq (r) obtained using our fiducial model 
with A — 14 and r cxc = 1.5ri8o- As shown in Fig. 1 this 
normalization of the halo concentrations yields the best-fit 
to the 2PCF of the dark matter. For comparison we also 
show results for a model with A — 9. As expected, reducing 
the halo concentration signicantly lowers the value of Q cq 
on small scales. 

The right-hand panel of Fig. 3 compares the model pre- 
dictions with the simulation results. The open squares with 
errorbars are the mean and 1-a variance of the 3PCFs for 
dark matter. On intermediate scales of r ~ 2 /i~ x Mpc the 
halo model predicts a significantly higher Q cq than for the 
numerical simulations. Since the 1-halo term of the 3PCF 
Cih oc M 3 , one expects Q eq (r) to be extremely senstive to 
the abundance of the most massive haloes. Due to the lim- 
ited volume probed by the Af-body simulations, there is a 
maximum halo mass above which the halo mass function is 
no longer properly sampled. For the simulations used here, 
we estimate this mass to be 2 x 10 15 /i _1 Mq. If we include 
an artifical cut-off at this mass scale in our theoretical mass 
function, we obtain the 3PCF shown by the dot-dashed 
curve in the right-hand panel of Fig. 3. In this case, the 
model prediction matches the simulation results extremely 
well. This clearly demonstrates that the three-point corre- 



lation function on intermediate scales is extremely sensitive 
to the abundance of massive haloes. 

Fig. 4 shows a comparison of model predictions and 
simulation results for non-equilateral triangle configura- 
tions. Symbols with errorbars correspond to the Q(r,u,v) 
obtained from the numerical simulations, with lines indi- 
cating the predictions from our fiducial model, where we 
have included an upper limit to the halo mass function of 
M = 2 x 10 15 h~ 1 M Q . Without this limit, the model predic- 
tions are higher by about 20%. On small scales, Q(r,u,v) 
depends weakly on r and v, but quite significantly on u. On 
large scales (r > 3/i _1 Mpc), Q(r,u,v) increases significantly 
with increasing v, which is consistent with previous studies 
(i.e. Jing & Borner 1997; Barriga & Gaztafiaga 2002). Over- 
all, the model predictions agree remarkable well with the 
simulation results on almost all scales and for all triangle 
configurations considered, indicating that the halo model, 
as presented here, is well suited to describe the 3PCF. 



5. 2. 2 The 3PCF for galaxies 

Next we focus on the normalized 3PCF for galaxies. As 
for the dark matter, we first consider the equilateral tri- 
angle configuration. The model predictions and MGDs re- 
sults are compared in Fig. 5. The solid and dashed lines 
are results obtained by assuming A = 14 and A — 9 for 
satellite galaxies (with r cxc = 2.0 riso and no truncation in 
the halo mass function), respectively. The dot-dashed curves 
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Figure 7. The normalized 3PCFs for mock galaxies with various assumptions about galaxy distribution in individual haloes (see text 
for details). The line styles are the same as in Fig. 2. 



are model predictions where an upper limit on halo mass, 
M = 2x 10 15 /i _1 M Q is included. The open squares with 
errorbars are the mean and 1-a errors obtained from 4 inde- 
pendent MGDs. As one can see, the model predictions match 
the simulation results extremely well. Comparing these re- 
sults with the corresponding results for dark matter, we see 
that the normalized 3PCFs for galaxies on small scales are 
much smaller and with a weaker dependence on A. And un- 
like for dark matter, the truncation in the halo mass func- 
tion does not have a significant impact on the normalized 
3PCF of galaxies. There are several reasons for these results. 
Firstly, haloes that host less than three galaxies do not con- 



tribute to the 1-halo term in the 3PCF of galaxies on small 
scales, and so the 3PCF on small scale is reduced. For the 
same reason, the effect of changing A is reduced, because 
the strong dependence of Q on A for the mass on small 
scales is due to low-mass haloes. Finally, the 3PCF of galax- 
ies is less sensitive to the truncation of the mass function, 
because the halo occupation number of galaxies in massive 
haloes increases roughly as M ' 8 (Yang et al. 2004). 

Fig. 6 shows normalized 3PCFs, Q(r,u,v), for galaxies 
with absolute magnitudes M bj — 51og/i < —18.5 and for 
non-equilateral triangle configurations. Lines and symbols 
correspond to model predictions and results obtained from 



© 0000 RAS, MNRAS 000, 000-000 



The Three-point Correlation Function of Galaxies 11 



the MGDs, respectively. Overall the agreement is satisfac- 
tory. Compared to the Q(r,u,v) of dark matter particles, 
the normalized 3PCF of galaxies, has a similar form, but 
with systematically lower amplitudes on small scales due to 
the reasons given above. 

In Section 5.1, we compared the 2PCFs for MGDs us- 
ing different models for the spatial distribution of satellite 
galaxies. Here, we make a similar comparison for the 3PCF. 
The results are shown in Fig. 7. Compared with the fidu- 
cial, A = 14 model (solid lines), the A — 9 model predicts 
Q(r,u,v) that are lower on small scales (r < 0.35 ft _1 Mpc), 
consistent with the model predictions shown in the left-hand 
panel of Fig. 5. The short-dashed lines indicate the results 
obtained when assigning satellite galaxies the position of 
a randomly selected particle from the FOF group associ- 
ated with the halo under consideration. Comparing these 
3PCFs on small scales with those from our fiducial model, 
in which we assume spherical NFW distributions, gives an 
idea as to how sensitive the 3PCF is to non-sphericity of 
the spatial distribution of galaxies within haloes. Although 
the Q(r, u, v) based on FOF satellites are slightly lower than 
those of our fiducial model, the differences are small (<, 20 
percent), suggesting that the assumption of spherical haloes 
does not lead to large systematic errors. The dotted curve 
shows the normalized 3PCFs obtained using Poisson sam- 
pling. Contrary to the 2PCF, the normalized 3PCF is not 
very sensitive to the second-order moment of the halo occu- 
pation numbers. Finally, the dot-dashed lines are the results 
for the MGD in which the number of satellite galaxies in 
each halo is assumed to follow a Poisson distribution. The 
values of Q(r, u, v) given by this model is quite similar to 
that given by the Poisson sampling model. 

It is somewhat surprising that Q(r,u,v) is quite insen- 
sitive to all these changes tested. The reason might be that 
these changes affect the 2PCF and 3PCF in a similar way, 
so that the effect is compensated in the normalized quantity, 
Q(r,u,v). 

5.2.3 Dependence on galaxy type and luminosity 

Since the CLF models used here contain information regard- 
ing both galaxy type and luminosity, we can investigate how 
the 3PCF depends on these quantities. 

Fig. 8 shows the Q(r, it, v) obtained for early- (left- 
hand panels) and late-type (right-hand panels) galaxies us- 
ing both the halo model (lines) and the MGDs (symbols). 
Overall, Q(r,u,v) for early-type galaxies is systematically 
higher than for late-type galaxies, and both galaxy types re- 
veal a different dependence on the shape of the triplet. The 
higher amplitude on small scales for the early-type galaxies 
reflects the fact that early-type galaxies in our mock sample 
are preferentially located in clusters. The 3PCFs for galax- 
ies of different types have been discussed in Takada & Jain 
(2003), who adopted the halo occupation models for red and 
blue galaxies of Scranton (2002). Contrary to our results, 
they found that red galaxies have smaller Q(r, u,v). Note 
that the CLF model and MGDs used in our study have 
been compared carefully with various observations. There- 
fore, we are confident that our halo occupation models are 
more accurate. 

We also investigate the luminosity dependence of the 
3PCF. Figure 9 plots the normalized 3PCFs for galaxies in 
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Figure 8. The normalized 3PCFs for early- and late-type galax- 
ies. 



two different luminosity ranges: —19.5 < Mb r — 51og/i < 
-18.5 and -20.5 < M bj - 5 log h < -19.5. Note that the 
luminosity dependence of the normalized 3PCFs is quite 
weak. This is due to the fact that a large fraction of rel- 
atively bright spiral galaxies are isolated central galaxies of 
galaxy-sized haloes. 

Unfortunately, we can not test the luminosity depen- 
dence of the 3PCF for even brighter galaxies. For galaxies 
with M b j - 5 log h < -20.5, for which the 2PCF is much 
stronger than for fainter galaxies (Yang et al. 2004), the 
number of galaxies is too small to give a reliable estimate of 
the normalized 3PCF. 



5.2.4 The impact of the quadratic bias term 

As mentioned in Section §2.3, we have neglected the 
quadratic term of the halo bias relation. Here we use simple 
considerations to assess the impact of this term on the pre- 
dictions of the normalized 3PCFs. As a simple approxima- 
tion, the normalized 3PCF of galaxies on quasi-linear scales 
(where 5 maS s <C 1) can be related to that of dark matter as 
follows, 

1 62 

Qg = |j Qmass + ^ (52) 

(Kayo et al. 2004) . Here b is given by eq. 49 and 
1 f°° 

b2 = ^ n(M)N(M) & 2 (M) AM , (53) 
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Figure 9. The normalized 3PCFs for mock galaxies in two abso- 
lute magnitude bins. The range of JW5 — 5 log h covered by each 
bin is marked in the two upper panels. 



with 62 (M) being the quadratic term of the halo bias (Mo, 
Jing & White 1997; Scoccimarro et al. 2001). Thus including 
the quadatic term on Q g is to add a term 62 /b 2 on large 
scales. To get an idea how big this term is, we estimate the 
average of this quantity using our MGDs discussed in this 
section. For the samples of all, early-type, late-type, bright, 
and faint galaxies discussed above, the values of 62 /b 2 are 
-0.16, -0.05, -0.29, -0.12, and -0.22, respectively. If this 
term is taken into account, the overall agreement between 
model prediction and simulation results may be improved 
slightly on large scales. 



6 COMPARISON WITH OBSERVATIONS 

6.1 The 2dFGRS and mock galaxy redshift 
surveys 

We use the final public data release from the 2dFGRS, 
which contains about 250, 000 galaxies with redshifts and 
is complete to an extinction-corrected apparent magnitude 
of bj w 19.45. The survey covers an area of ~ 1500 square 
degrees selected from the extended APM Survey (Maddox 
ct al. 1996). The survey geometry consists of two separate 
declination strips in the North Galactic Pole (NGP) and 
the South Galactic Pole (SGP), respectively, together with 
100 2-degree fields spread randomly in the southern Galac- 
tic hemisphere. In this paper, we will use galaxies in the 
NGP and SGP to estimate the apparent-magnitude limit 



redshift space 3PCFs. Only those galaxies with redshift 
0.01 < 2 < 0.2, spectra quality q > 3, and redshift com- 
pleteness > 0.7 are considered. 

In order to carry out a proper comparison between 
model and observations, we construct mock galaxy redshift 
surveys (hereafter MGRS) with the same selection criteria 
and observational biases as in the 2dFGRS. We follow the 
procedure used in Yang et al. (2004) and stack various sim- 
ulation boxes (of different sizes) together to sample a suf- 
ficiently large volume and with sufficient resolution. This 
allows us to construct MGRSs with the same depth (in red- 
shift) as the 2dFGRS and with full sampling of the luminos- 
ity function down to Mbj — 5 log ft = —13.5. Observational 
selection effects in the 2dFGRS, such as position dependent 
magnitude limits, position and magnitude dependent com- 
pleteness, errors in magnitude and redshift, are all taken into 
account (see Yang et al. 2004 and van den Bosch et al. 2004 
for details) . Using the full set of numerical simulations avail- 
able to us, we construct eight independent MGRSs which we 
use for comparison with the 2dFGRS. 

6.2 The redshift-space 3PCF of galaxies 

We calculate the 3PCFs in redshift-space using the same 
method as described in Section 5.2, except that redshifts of 
galaxies are used as distances in calculating the number of 
triplets. To estimate the number of triangles for the random 
distribution, i.e. in estimating RRR, we use random samples 
that are 32 times as dense as the observational sample for 
s < 2.0 h _1 Mpc (here s is the length of the shortest side 
of the triplet in redshift space). For 2.0 < s < 5.0 ft _1 Mpc, 
the random sample is 8 times as dense as the observational 
sample, while for s > 5.0/i _1 Mpc, the random samples have 
the same density as the observational sample. 

We use the same method as in equations (51) and (29) 
to calculate the normalized redshift-space 3PCF, Q(s,u,v). 
In order to take account of observational selection ef- 
fects, each triangle is weighted by [wi (si2)u>2(s23)u>3(s3i) + 
wi(si3)w 3 (s32)w2(s2i)]/2, where 



Wi(Sij) 



1 + &im(zi)Jz(sij) 



(54) 



Here n(z) is the density of galaxies as a function of red- 
shift and J 3 (s) = ^{x)x 2 dx, where we follow Hawkins et 
al. (2003) by adopting £(s) = (s/13) -0 75 . 

The overall normalization in equation (51), i.e. the ra- 
tio between Nr and No, is estimated by summing over ran- 
dom points and galaxies, each weighted by l/n(zi). For the 
two-point correlation function used in Q(s,u, v), we use the 
estimator proposed by Hamilton (1993), 

DP x RR 

= DIP ' (55) 

where DD is the sum of galaxy-galaxy pairs with separation 
s, RR and DR are the sums of random-random and galaxy- 
random pairs with the same separation, respectively. Each 
pair is weighted by Wi(sij)wj(sij). 

6.3 Results 

We first consider the normalized 3PCFs of equilateral trian- 
gles, Qeq(s)- The 2dFGRS results are shown as the open cir- 
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Figure 10. The redshift-spacc 3PCFs of equilateral triangles for 2dFGRS galaxies (circles in the right panel) compared with model 
predictions. The left panel shows the results for mock 2dFGRS samples which use dark matter particles as galaxies. The lines with 
errorbars in the right panel are results for mock 2dFGRS samples based on the CLF model. The errorbars are 1-cr variance among 8 
independent mock samples. 



cles in the right panel of Fig. 10. The lines with errorbars in 
the right panel are results for the mock samples, with the er- 
rorbars giving the 1-cr scatter among the 8 MGRSs. Over all 
scales, the redshift-space 3PCFs obtained from our MGRSs 
are in good agreement with the observational results. We 
emphasize that this is a success of our CLF model in ac- 
counting the bias of galaxy distribution relative to the mass. 
To demonstrate this more clearly, we construct MGRSs in 
which dark matter particles are randomly chosen to rep- 
resent the 'galaxy population', with each particle assigned 
a luminosity according to the 2dFGRS luminosity function. 
Thus, the 'galaxy distribution' is un-biased in these MGRSs. 
The lines with errorbars in the left panel of Fig. 10 shows the 
normalized, redshift-space 3PCF thus obtained. This 3PCF 
is very different from that given by the mock sample based 
on the CLF, which reflects the effect of the bias in the dis- 
tribution of galaxies relative to the mass. 



GRS observations, especially on small scales (see also Jing 
& Borner 1998, 2004). 

Fig. 12 shows Q(s,u,v) for galaxies of different 
types. Note that early-type galaxies have only slightly 
higher Q(s,u,v) than late-type galaxies. The strong type- 
dependence seen in the real-space 3PCF is not seen here, 
mainly because the velocity dispersion of galaxies in individ- 
ual clusters tends to reduce the correlations on small scales. 
Once again, the Q(s,u,v) obtained from our MGRSs are in 
excellent agreement with the observations, for both early- 
and late-type galaxies. Finally, Fig. 13 shows that there is no 
significant luminosity dependence of Q(s,u, v) in the data, 
a result that, once more, matches well with our model pre- 
diction. Note that these type and luminosity dependences 
are also found in various recent works (Jing & Borner 2004; 
Kayo et al. 2004). 



Symbols in Fig. 11 shows the normalized 3PCF, 
Q(s, u, v), as a function of s, u and v obtained from the 2dF- 
GRS. Jing & Borner (2004) recently estimated the 3PCFs 
of galaxies using the 2dFGRS early data release, and their 
results for the redshift-space 3PCFs are similar to ours. The 
thick lines with errorbars are results for the mock sam- 
ples based on CLF model. Again, the redshift space 3PCFs 
Q(s,u,v) of our MGRSs are in good agreement with the 
2dFGRS observational results on all scales and for different 
triangle configurations. The thin lines in Fig. 11 correspond 
to MGRSs using dark matter particles as galaxy tracers. 
Consistent with the results shown in Fig. 10, these MGRSs 
predict 3PCFs that are in poor agreement with the 2dF- 



7 CONCLUSIONS 

In this paper, we have used the halo model combined with 
the conditional luminosity function formalism to predict the 
3PCFs for both dark matter and galaxies. These analyti- 
cal model predictions have been compared with results ob- 
tained from high-resolution iV-body simulations and from 
mock galaxy distributions constructed from these simula- 
tions. With proper assumptions, the halo model can match 
the mass 3PCF reasonably well. On small, non-linear scales 
(<; 0.5 fr _1 Mpc) the 3PCF is contributed mainly by the 1- 
halo term, and is sensitive to the concentration of the dark 
matter haloes. On intermediate scales (r ~ 2fo _1 Mpc) where 
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Figure 11. The redshift-spacc 3PCFs for 2dFGRS galaxies (sym- 
bols) compared with model predictions. The thick lines with 
errorbars are results for mock 2dFGRS samples based on the 
CLF model. The errorbars are 1-cr variance among 8 indepen- 
dent mock samples. The thin lines are results for mock samples 
which use dark matter particles as galaxies. For clarity, the results 
for u = 3.05 are shifted up by 1.0. 



both the 1-halo term and 2-halo term contribute signifi- 
cantly, the 3PCF is sensitive to abundance of the few most 
massive haloes. Due to the way in which galaxies are biased 
with respect to the mass, and due to the discreteness in the 
galaxy distribution, both effects are much weaker for galax- 
ies than for mass. Overall, galaxies have lower normalized 
3PCFs than dark matter, which is mainly due to a strongly 
suppressed 1-halo term. 

We also investigate the dependence of the 3PCF on 
galaxy type and luminosity. Since early-type galaxies reside 
preferentially in massive haloes, they have higher normalized 
real-space 3PCF than late-type galaxies. The dependence on 
galaxy luminosity, however, is found to be much weaker. 

Finally, we have compared the redshift-space 3PCF of 
galaxies predicted by current CDM model with the obser- 
vational results obtained from the 2dFGRS. We have shows 
that, with the more realistic mock samples based on the CLF 
in dark haloes, the model predictions are in good agreement 
with observations in redshift space. These results provide 
further support to the CLF model advocated by Yang et al. 
(2003) and van den Bosch et al. (2003). 
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Figure 12. The redshift space 3PCFs for early-type (left panels) 
and late-type (right panels) galaxies. The symbols are results ob- 
tained from the 2dFGRS. The lines with errorbars are results for 
mock 2dFGRS samples based on the CLF model. The errorbars 
arc 1-cr variance among 8 independent mock samples. For clarity, 
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